Name | Version | Summary | date |
aspose-html-net |
25.7.0 |
Aspose.HTML for Python via .NET is a powerful API for Python that provides a headless browser functionality, allowing you to work with HTML documents in a variety of ways. With this API, you can easily create new HTML documents or open existing ones from different sources. Once you have the document, you can perform various manipulation operations, such as removing and replacing HTML nodes. |
2025-07-23 15:49:34 |
pydantic-scrape |
0.1.2 |
A modular web scraping framework using pydantic-ai and pydantic-graph with intelligent caching |
2025-07-23 11:55:16 |
journ4list |
0.7.0 |
A powerful async news content extraction library with modern API for web scraping and article analysis |
2025-07-22 21:08:16 |
scrapy-item-ingest |
0.1.2 |
Scrapy extension for database ingestion with job/spider tracking |
2025-07-22 13:34:03 |
melodic |
1.1.0 |
A Python client for fetching artist lyrical discographies. |
2025-07-21 22:54:44 |
humanization-playwright |
0.1.2 |
A library for human-like interactions in Playwright automation, uses Patchright to avoid bot detection and human-like cursors and typing interactions |
2025-07-21 05:11:00 |
tokopaedi |
0.2.1 |
A Python scraper for Tokopedia that supports filtered product search, detailed product information, and customer reviews with accurate mobile pricing and Jupyter Notebook compatibility. |
2025-07-20 10:35:00 |
pydoll-mcp |
1.5.15 |
Revolutionary Model Context Protocol (MCP) server for PyDoll browser automation with zero-webdriver operation and intelligent captcha bypass |
2025-07-20 09:54:18 |
bitchute-scraper |
1.0.0 |
A modern, API-based package to scrape BitChute platform data. |
2025-07-20 09:08:17 |
juscraper |
0.1.3 |
Raspador de tribunais e outros sistemas relacionados ao poder judiciário. |
2025-07-20 01:31:16 |
web-maestro |
1.0.0 |
Production-ready web content extraction with multi-provider LLM support and intelligent browser automation |
2025-07-15 01:39:53 |
rapid-crawl |
0.1.0 |
A powerful Python SDK for web scraping, crawling, and data extraction - inspired by Firecrawl |
2025-07-11 12:32:22 |
browser-captcha-solver |
1.0.3 |
A Python library for browser-based captcha solving |
2025-07-09 22:00:28 |
DrissionPage-expend |
1.0.1 |
DrissionPage XHR请求扩展库,支持多种数据类型和请求方式 |
2025-07-09 14:43:36 |
drissionpage-xhr-extend |
1.0.0 |
DrissionPage XHR请求扩展库,支持多种数据类型和请求方式 |
2025-07-09 14:21:50 |
googlesearch-tool |
1.1.2 |
A Python library for performing Google searches with support for dynamic query parameters, result deduplication, and custom proxy configuration. |
2025-02-17 10:24:53 |
betterhtmlchunking |
0.9.1 |
A Python library for intelligent HTML segmentation and ROI extraction. It builds a DOM tree from raw HTML and extracts content-rich regions for efficient web scraping and analysis. |
2025-02-14 08:21:28 |
opgg.py |
2.0.3 |
An unofficial Python library for scraping/accessing data from OP.GG |
2025-02-14 05:15:29 |
ambi-alert |
0.0.2 |
This is a reverse search tool. Agentic Alerting |
2025-02-07 23:39:47 |
supadata |
1.0.2 |
The official Python SDK for Supadata - scrape web content and YouTube transcripts with ease |
2025-01-27 14:20:52 |